Exploring fusion strategies for accurate RGBT visual object tracking

نویسندگان

چکیده

We address the problem of multi-modal object tracking in video and explore various options available for fusing complementary information conveyed by visible (RGB) thermal infrared (TIR) modalities, including pixel-level, feature-level decision-level fusion. Specifically, contrast to existing approaches, we propose develop paradigm combining image fusion at pixel level. At feature level, two different kinds strategies are investigated completeness, i.e., attention-based online strategy offline-trained block. decision a novel is put forward, inspired success simple averaging configuration which has shown so much promise. The effectiveness proposed owes number innovative contributions, dynamic weighting RGB TIR contributions linear template update operation. A variant method produced winning tracker Visual Object Tracking Challenge 2020 (VOT-RGBT2020). comprehensive comparison with highlights advantages multimodal score Extensive experimental results on five challenging datasets, GTOT, VOT-RGBT2019, RGBT234, LasHeR VOT-RGBT2020, demonstrate robustness method, compared state-of-the-art approaches. Code https://github.com/Zhangyong-Tang/DFAT.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fusion of Multimodal Visual Cues for Model-Based Object Tracking

While many robotic applications rely on visual tracking, conventional single-cue algorithms typically fail outside limited tracking conditions. Fusion of multimodal visual cues with complementary failure modes allows tracking to continue despite losing individual cues. While previous applications have addressed multi-cue 2D featurebased tracking, this paper develops a fusion scheme for 3D model...

متن کامل

Convolutional Gating Network for Object Tracking

Object tracking through multiple cameras is a popular research topic in security and surveillance systems especially when human objects are the target. However, occlusion is one of the challenging problems for the tracking process. This paper proposes a multiple-camera-based cooperative tracking method to overcome the occlusion problem.  The paper presents a new model for combining convolutiona...

متن کامل

Accurate Scale Estimation for Robust Visual Tracking

Robust scale estimation is a challenging problem in visual object tracking. Most existing methods fail to handle large scale variations in complex image sequences. This paper presents a novel approach for robust scale estimation in a tracking-by-detection framework. The proposed approach works by learning discriminative correlation filters based on a scale pyramid representation. We learn separ...

متن کامل

Distributed Sensor Fusion for Object Tracking

In a dynamic situation like robot soccer any individual player can only observe a limited portion of their environment at any given time. As such to develop strategies based upon planning and cooperation between different players it is imperative that they be able to share information which may or may not be in any individual player’s field of vision. In this paper we propose a method for multi...

متن کامل

Group Tracking: Exploring Mutual Relations for Multiple Object Tracking

In this paper, we propose to track multiple previously unseen objects in unconstrained scenes. Instead of considering objects individually, we model objects in mutual context with each other to benefit robust and accurate tracking. We introduce a unified framework to combine both Individual Object Models (IOMs) and Mutual Relation Models (MRMs). The MRMs consist of three components, the relatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Information Fusion

سال: 2023

ISSN: ['1566-2535', '1872-6305']

DOI: https://doi.org/10.1016/j.inffus.2023.101881